Roadmap¶
Complete¶
Storage & Data Model¶
-
Entrydata model with structured metadata and type-specific extensions -
DistilleryStoreprotocol — async storage abstraction enabling backend migration - DuckDB backend with VSS extension and HNSW index (cosine similarity)
- Configurable embedding providers (Jina v3 default, OpenAI adapter)
- Embedding model lock via
_metatable — prevents mixed-model corruption - MCP server with 18 tools over stdio and streamable-HTTP
-
distillery.yamlconfig system with validation
Core Skills¶
-
/distill— session knowledge capture with duplicate detection -
/recall— semantic search with provenance display -
/pour— multi-pass retrieval + structured synthesis with citations -
/bookmark— URL fetch, auto-summarize, store with dedup check -
/minutes— meeting notes with--update(append) and--listmodes - Shared
CONVENTIONS.md— author/project identification, error handling patterns
Classification Pipeline¶
-
ClassificationEngine— LLM prompt-based type assignment with confidence scoring -
DeduplicationChecker— skip/merge/link/create at configurable thresholds -
/classifyskill — classify by ID, batch inbox, review queue triage
Quality & Observability¶
- Implicit retrieval feedback + quality metrics (now via
distillery_metrics(scope="search_quality")) - Stale entry detection —
distillery_staletool - Conflict detection (now via
distillery_find_similar(conflict_check=true)) - Usage metrics dashboard —
distillery_metricstool
Infrastructure¶
- FastMCP 2.x/3.x with
@server.tooldecorators - Hierarchical tag namespace with validation and
distillery_tag_treetool - 12 entry types including
person,project,digest,github,feed -
distillery_type_schemasMCP tool for schema discovery
Team Access¶
- HTTP transport —
distillery-mcp --transport http - GitHub OAuth — team authentication via FastMCP
GitHubProvider - Prefect Horizon deployment (MotherDuck)
- Fly.io deployment with persistent DuckDB on volume
- Namespace taxonomy — hierarchical, validated tag system
Ambient Intelligence¶
-
/radar— interest-driven feed digest with AI source suggestions -
/watch— add/remove/list monitored feed sources -
/tune— adjust relevance thresholds and trust weights - Feed polling architecture —
FeedPollerwith configurable intervals - Source adapters — GitHub events (REST API) and RSS/Atom
- Relevance scoring pipeline — embedding-based cosine similarity
- Interest extractor — mines entries for tags, domains, repos, expertise
- Auto-tagging — source tags (
source/github/owner/repo,source/reddit/sub) and topic tags from KB vocabulary -
distillery retagCLI — backfill tags on existing feed entries
Search¶
- Hybrid BM25 + vector search — DuckDB FTS extension with Reciprocal Rank Fusion (RRF)
- Recency decay — configurable time-weighted scoring (90-day window, 0.5 min weight)
- Graceful degradation — falls back to vector-only if FTS extension unavailable
Onboarding¶
-
/setupskill — MCP connectivity wizard, auto-poll configuration - uvx-first setup —
uvx distillery-mcpas recommended first-time path
Planned¶
New Skills¶
-
/whois— evidence-backed expertise map -
/investigate— deep domain context builder -
/digest— team activity summaries -
/briefing— team knowledge dashboard -
/process— batch classify + digest + stale detection pipeline -
/gh-sync— GitHub issue/PR knowledge tracking
Infrastructure¶
- RRF score normalization — hybrid search scores cluster near 1.0 (#170)
- GitHub event content filtering — skip low-value WatchEvent/ForkEvent (#171)
- Access control — team/private visibility flag (#149)
Deferred¶
- LangGraph evaluation for complex skill orchestration
- CODE pipeline formalization for team workflows
- Web UI or REST API
- Multi-team support and cross-team knowledge sharing
- Re-embedding migration tooling
Technology Stack¶
| Layer | Current | Planned |
|---|---|---|
| Interface | Claude Code skills | Same |
| Transport | stdio + streamable-HTTP | Same |
| Auth | GitHub OAuth (FastMCP) | + multi-team RBAC |
| Storage | DuckDB + VSS + FTS / MotherDuck | Same |
| Search | Hybrid BM25 + vector (RRF) | + score normalization |
| Embeddings | Jina v3 / OpenAI | Same |
| Language | Python 3.11+ | Same |
| Hosting | Local / Fly.io / Prefect Horizon | Same |